Distinctive Phonetic Feature (dpf) Based Phone Segmentation Using 2-stage Multilayer Neural Networks
نویسندگان
چکیده
Segmentation of speech into its corresponding phones has become very important issue in many speech processing areas such as speech recognition, speech analysis, speech synthesis, and speech database. In this paper, for accurate segmentation in speech recognition applications, we introduce Distinctive Phonetic Feature (DPF) based feature extraction using a two-stage MLN (Multi-Layer Neural Network) system consists of an MLNLF-DPF in the first stage and an MLNDyn in the second stage. The MLNLF-DPF maps continuous acoustic features, Local Feature (LF), onto discrete DPF patterns, while the MLNDyn constraints DPF context or dynamics in an utterance. The experiments are carried out using Japanese triphthong data. The proposed DPF based feature extractor provides good segmentation and high recognition rate with a reduced mixture-set of HMMs (Hidden Markov Models) by resolving co-articulation effect.
منابع مشابه
Distinctive phonetic feature (DPF) based phone segmentation using hybrid neural networks
Segmentation of speech into its corresponding phones has become very important issue in many speech processing areas such as speech recognition, speech analysis, speech synthesis, and speech database. In this paper, for accurate segmentation in speech recognition applications, we introduce Distinctive Phonetic Feature (DPF) based feature extraction using a twostage NN (Neural Networks) system c...
متن کاملPhoneme recognition based on hybrid neural networks with inhibition/enhancement of distinctive phonetic feature (DPF) trajectories
In this paper, we introduce a novel distinctive phonetic feature (DPF) extraction method that incorporates inhibition/enhancement functionalities by discriminating the DPF dynamic patterns of trajectories relevant or not. The trajectories of each DPF show a convex pattern when the DPF is relevant and a concave one when irrelevant. The proposed algorithm enhances convex type patterns and inhibit...
متن کاملCanonicalization of feature parameters for automatic speech recognition
Acoustic models (AMs) of an HMM-based classifier include various types of hidden variables such as gender type, speaking rate, and acoustic environment. If there exists a canonicalization process that reduces the influence of the hidden variables from the AMs, a robust automatic speech recognition (ASR) system can be realized. In this paper, we describe the configuration of a canonicalization p...
متن کاملEffects of Syllable Language Model on Distinctive Phonetic Features (DPFs) based Phoneme Recognition Performance
This paper presents a distinctive phonetic features (DPFs) based phoneme recognition method by incorporating syllable language models (LMs). The method comprises three stages. The first stage extracts three DPF vectors of 15 dimensions each from local features (LFs) of an input speech signal using three multilayer neural networks (MLNs). The second stage incorporates an Inhibition/Enhancement (...
متن کاملSelected Papers of the IEEE International Conference on Computer and Information Technology
This paper presents a distinctive phonetic features (DPFs) based phoneme recognition method by incorporating syllable language models (LMs). The method comprises three stages. The first stage extracts three DPF vectors of 15 dimensions each from local features (LFs) of an input speech signal using three multilayer neural networks (MLNs). The second stage incorporates an Inhibition/Enhancement (...
متن کامل